Comparing spatial maps of human population-genetic variation using Procrustes analysis.
نویسندگان
چکیده
Recent applications of principal components analysis (PCA) and multidimensional scaling (MDS) in human population genetics have found that "statistical maps" based on the genotypes in population-genetic samples often resemble geographic maps of the underlying sampling locations. To provide formal tests of these qualitative observations, we describe a Procrustes analysis approach for quantitatively assessing the similarity of population-genetic and geographic maps. We confirm in two scenarios, one using single-nucleotide polymorphism (SNP) data from Europe and one using SNP data worldwide, that a measurably high level of concordance exists between statistical maps of population-genetic variation and geographic maps of sampling locations. Two other examples illustrate the versatility of the Procrustes approach in population-genetic applications, verifying the concordance of SNP analyses using PCA and MDS, and showing that statistical maps of worldwide copy-number variants (CNVs) accord with statistical maps of SNP variation, especially when CNV analysis is limited to samples with the highest-quality data. As statistical maps with PCA and MDS have become increasingly common for use in summarizing population relationships, our examples highlight the potential of Procrustes-based quantitative comparisons for interpreting the results in these maps.
منابع مشابه
A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-n...
متن کاملGenetic structure and phenotypic diversity of two northern populations of Cheilosia aff. longula (Diptera: Syrphidae) has implications for evolution and conservation
The genetic structure and phenotypic diversity of two populations of Cheilosia aff. longula (Diptera: Syrphidae) in Lapland, Finland, were examined using DNA sequencing, protein electrophoresis, and geometric morphometrics. The morphological identification of the species were verified using partial sequences of mitochondrial cytochrome c oxidase subunit I (COI mtDNA), and the nuclear ribosomal ...
متن کاملNormalization of qPCR array data: a novel method based on procrustes superimposition
MicroRNAs (miRNAs) are short, endogenous non-coding RNAs that function as guide molecules to regulate transcription of their target messenger RNAs. Several methods including low-density qPCR arrays are being increasingly used to profile the expression of these molecules in a variety of different biological conditions. Reliable analysis of expression profiles demands removal of technical variati...
متن کاملCorrecting Principal Component Maps for Effects of Spatial Autocorrelation in Population Genetic Data
In many species, spatial genetic variation displays patterns of "isolation-by-distance." Characterized by locally correlated allele frequencies, these patterns are known to create periodic shapes in geographic maps of principal components which confound signatures of specific migration events and influence interpretations of principal component analyses (PCA). In this study, we introduced model...
متن کاملModeling inequality levels with the help of spatial and non-spatial indices in northern Khorasan
This paper aims to explain the inequality and imbalance in the developmental levels of 6 selected cities in North Khorasan. The paper seeks to answer these two questions as to whether the spatial and non-spatial indices in regional disparities have an effect on equality? And can we achieve a functional model based on the evaluation of indicators? In order to achieve the goal and the answer to t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Statistical applications in genetics and molecular biology
دوره 9 شماره
صفحات -
تاریخ انتشار 2010